AgentCPM-GUI is an on-device graphical interface agent with RFT-enhanced reasoning capabilities, capable of operating Chinese and English applications, built upon the 8-billion-parameter MiniCPM-V.
Image-to-Text
Safetensors Supports Multiple Languages